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ABSTRACT 

A brief history of the development of tables of random numbers is 
presented. This is followed by an investigation of the generation of 
uniformly distributed random numbers on the GDC 160ii digital computer. 
Certain congruential relations are examined in order to determine 
parameters that will pi'oduce acceptable random number sequences. Sta- 
tistical tests of the generated numbers are described and the results 
of these tests are presented. 
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PREFACE 



Statistical sampling techniques using a "Monte Carlo" method can 
require a large supply of random numbers. Such techniques have been used 
in the evaluation of definite integrals, the solution of ordinary and 
partial differential equations, the solution of integral equations and 
matrix inversion, to mention only a few applications. On the other hand, 
some problems, described in terms of a stochastic model, can only be 
solved by recourse to sampling techniques, as for example, mathematical 
v;ar gaming problems. 

When such models are programmed for a high speed digital computer, 
it is necessary to produce large numbers of random numbers quickly. The 
tables of random digits or random uniform numbers customarily used in 
hand calculations do not lend themselves to use in the high speed compu- 
ter. It is inefficient to read into the computer the large number of 
random digits required to meet the demand for such numbers arising in 
problems which are otherwise within the capabilities of the machine. 

Using the same numbers repeatedly in a given problem would introduce 
bias. 

It has been suggested that special equipment which utilizes a phys- 
ical random process be used to produce the random numbers. On the sur- 
face, this would seem desirable but, in practice, difficulties immedi- 
ately arise. First, the construction and maintenance of such a device 
would be quite expensive. To be useful, it would be required to produce 
a number in the order of a few microseconds for extended periods. Reli- 
ability requirements would necessitate its output to be continually 
checked for randomness. The second difficulty is that such a random 
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device would not permit the recalculation of a problem using the same 
generated numbers. This could prove to be a handicap in checking on 
machine operations. 

These difficulties lead to the exploration of arithmetical schemes 
for generating numbers which take advantage of the computer’s high speed 
and place minimum demands on computer memory storage. Such numbers 
could not be considered truly random j hence, the adoption of the term 
"pseudo-random numbers" . 

It is the intent of this paper to present a brief historical devel- 
opment of random number generation and to explore, in particular, the 
multiplicative congruential type of generator for adaptation to the 
GDC I60i^. 
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I . INTRODUCTION 



1, Kendall and Smith's Random Sampling Numbers. 

It might be supposed that if one wanted to obtain a sequence of 
random digits that almost any method that employed some sort of chaotic 
selection would suffice. Observing the indications of a roulette wheel 
or, perhaps, an unsystematic selection of digits from a large telephone 
directory might seem like "reasonable" methods. Once the digits were 
collected, however, a problem would arise. By what criterion could 
their randomness be judged? 

It was to this problem that M. G. Kendall and B. Babington Smith 

addressed themselves in their now classic work, "Randomness and Random 

r i 1 

Sampling Numbers". !lj They start their paper with a discussion on 

the relation between randomness and probability. 

In colloquial speech the word "random" is applied to any method 
of choice which lacks aim or purpose j and this usage is also found 
in certain sciences. In statistics, however, the word has a some- 
what different and more definite significance, closely related to 
probability. It appears, in fact, that for statistical purposes 
the ideas of randomness and probability are inseparable, vAiether 
one belongs to the "intuitive" school which regards probability as 
an undefinable, or to the opposing "frequency" school which seeks 
to explain it in terms of statistical frequencies. 

Having linked the concepts of probability and randomness, Kendall 

and Smith concede that there are many controversial areas that pose 

abstract problems that are, — "verging, at times, on the theological." 

In the interest of getting on with the more mundane problems of Random 

Sampling Numbers, they state: 

We take randomness and probability to be undefined ideas obeying 
certain intuitively formulated principles, xjhich will be found suf- 
ficient to give results of practical application. 



■“■Numbers in brackets refer to references cited in the Bibliography at 
the end of the paper. 
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The authors note that any combination of digits could be considered 
a random set, in that all combinations are possible when drawing from an 
infinite universe of digits. However, not all combinations are practical 
for random sampling; for example, a block of digits which are all the 
same. For purposes of sampling, a definition of Random Sampling Number 
is made: a set of numbers which can be used for random sampling, not 

necessarily a set obtained by random methods. It is Kendall and Smith's 
intention to produce a table of random digits and to test these for 
acceptability as Random Sampling Numbers. That is, Random Sampling Num- 
bers is a collective term for a series of random digits. 

Such a set of Random Sampling Numbers should conform to the notion 
of relative frequency. That is, if samples were taken from a universe 
consisting of an infinite number of digits such that each of the ten 
digits 0 through 9 may be expected to appear with the same frequency, 
then, it is expected that in a large number of trials each of the digits 
will appear approximately an equal number of times. Further, it would 
be expected that each pair of digits would occur approximately the same 
number of times, similarly triples, etc.; that is, no digit would tend 
to follow another digit in any consistent pattern. 

Here then is an important distinction: any given set of N numbers 

need not follow such expectations but a set of Random Sampling Numbers 
should do so. Hereafter, we shall identify such a set of Random Samp- 
ling Numbers as a random set. 

In order to ascertain whether a given set of digits is a good enough 
"approximation" to a random set, the observed frequencies may be compared 
to the theoretical frequencies by means of a chi-square test. On ''purely 
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arbitrary grounds" , Kendall and Smith elected to say that a set of num- 
bers is acceptable if the resulting chi-square statistic is within the 
range of 0,01 — P — 0.99, where, with 0^ the observed and Ej the 
expected frequencies of the jth digit, 



and 



= I 

j = o 



(°1 - 



P 




o 





dX 




9-1 

X dX 



with 9 being the number of degrees of freedom. 

Having thus determined the characteristics of this set, the authors 
then proceed to establish a series of four tests for a set of digits to 
be considered as a random set. Subsequently, these tests have become 
standard in the literature on random numbers. Briefly, they are as fol- 
lows: 

(a) The frequency test to determine if all the digits of a set 
occur in approximately equal amounts. 

(b) The serial test which tests to see if any digit tends to 
follow another. 

(c) The poker test which is named after the card game and examines 
the digits in blocks of five for five of a kind, four of a kind 
etc. The frequency of these hands are then compared with the 
expected distribution of such hands. 
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(d) The gap test which inquires into the gaps occurring between the 



same digits in a series and compares this with the theoretical 
distribution. 

Although these tests are not sufficient, the authors feel that collec- 
tively they are powerful. 

V/ith the foregoing criteria established, Kendall and Smith attempt 
to set up a random series by selecting digits from the London Telephone 
Directory in a haphazard fashion. Even though they took what would seem 
like extra precautions to avoid bias, the results of their tests indi- 
cated significant bias. 

They then constructed an electro-mechanical device to produce random 
digits. Essentially, it consisted of a wheel divided into ten equal seg- 
ments with a digit assigned to each segment. It was driven by an elec- 
tric motor whose speed was accurately controlled. The operator on a 
separate circuit controlled a neon lamp whose flash was of such a short 
duration that the moving wheel appeared to be stationary. The number 
that appeared under a previously fixed pointer was recorded as the ran- 
dom digit. The actuation of the lamp was kept haphazard by having the 
operator move a stylus over an intricate maze of circuitry which, it was 
maintained, caused the light to operate in a random fashion. 

A table of 100,000 digits was produced in this fashion and, based 
on their tests, were considered acceptable as Random Sampling Numbers j^lj. 

2. RANT) Corporation's Table of Random Digits. 

Kendall and Smith's table appears to have been satisfactory for a 
number of yearsj at least, there were no other tables published. In 
19^7} a group of research workers in the RAN'D Corporation responded to a 



growing need for random numbers by producing a table of one million ran- 
dom digits ! 2j . These were needed to solve various computerized 
research problems by experimental probability procedures , Such proce- 
dures were called by the colorful name, Monte Carlo Methods o For man?/ 
of the problems that RAIID was working on, the need for numbers far 
exceeded those of Kendall and Smith's Tables. In fact, it meant that 
these tables would have to be used time and again in a particular prob- 
lem, This presented the consequent danger of introducing unwanted cor- 
relations. Such a grovjing thirst for random numbers was, of course, due 
to new problems and rapidly developing computer technology available for 
solving the se problems . 

RAND'S table was produced by electronic-mechanical means and was 

apparently more sophisticated than Kendall and Smith's machine. 

In principle the machine was a 32 -place roulette wheel which made, 
on the average, about 3,000 revolutions per trial and produced 
one number per second. 

It appears that the engineers designing the device did have many 
problems because the original machine showed statistically significant 
biases and had to be modified extensively. Further, even though elec- 
tronic checks were continually made, after a month the results of tests 
of a block of numbers produced showed significant bias. This indicated 
that the machine was running dovm even though the electronic checks were 
acceptable . 

3. Pseudo-random Mxunber Generation, 

While RAND'S table was suitable for work with punched card compu- 
ters, it proved impractical vfith the advent of high speed electronic 
computers because: 



(a) it placed severe storage requirements on the machine 

(b) reading in the tables was slow 

(c) much larger tables were required for the solution of certain 
types of problems. 

These shortcomings led to an interest in methods for producing 
"pseudo-random" numbers by arithmetic means. K definition of this tern 
was given by D, H. Lehmer as: 

A pseudo-random sequence is a vague notion embodying the idea of 
a sequence in which each term is unpredictable to the uninitiated 
and v;hose digits pass a certain number of tests traditional with 
statisticians and depending somewhat on the uses to which the 
sequence is to be put. 

One of the first methods used was proposed by von Neumann and Metro- 
polis wherein an arbitrary n-digit number, Yq , was selected. This num- 
ber was then squared and was produced as an n-digit number from the 

2 

center of the 2 n-digits of Yq . The process was then repeated, produc- 
ing Yj^+]^ from Y^. 



Although this process generated pseudo-random numbers, its period 



was not predictable. Further, it was demonstrated by D, H. Lehmer 
that, in general, it would not produce a very long cycle before the 
process degenerated into a sequence of zeros. Consequently, it could 
not be recommended as a source of great quantities of random numbers. 
During the second symposium at the Harvard Computation Laboratory, 



D. H. Lehmer suggested 
tion (See Appendix A) 



that the multiplicative congruential rela- 



Vl = ^ ^r (1-1) 

could be used where the least positive residue is taken as the generated 
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number. In effect, this statement says that ( ^ ^ ^ is an 

integer. The modulus M could be selected so as to be compatible with 
the base of the machine being used; that is, it could be of the form 
(2^) or (10^) . P normally is selected to be the number of positions 
available in a word of the computer being used. Further, in a binary 
machine, with a proper selection of L, the period of this generator is 
2(P-2). 

Using P in this way avoids the necessity for performing the final 
division operation in most high speed computers. For example, in the 
GDC l6oU using a modulus of 2^"^, the multiplication ( KUI ) of the number 
L by Yj. positions the product in the Qk registers. The U? low-order 
bits of this product in the A register are, in fact, the desired resid- 
ual or the nevrly generated random number. 

Although such a method is relatively fast, nevertheless, it does 
use the multiplication command which, for a computer, is time consuming. 



Greenberger 



proposed that L in equation (1-1) be modified to 



= ( 2^ + 3 ) 



(mod 2^) 



(1-2) 



with A — 3. By doing this, the multiplication could be performed by a 

shift command and 3 add commands which represented a considerable saving 

in time over the straight multiplication instruction. Further, he shows 

1 ^ 5 ] that an L of this form does satisfy the conditions necessary to 

P-2 

insure the full period of 2 

Another variation of equation (1-1) was proposed by Rotenberg 
whose generator became 



“■r+l 



( 2^ + 1 ) Yj. + C 



(mod 2^) 



(1-3) 
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This again was faster than Greenberger ' s model by one add instruc- 
tion but, unlike the former, this gave a full period of 2 for A — 2 
and C an odd integer. 

Equations (1-2) and (1-3) will be investigated in Chapters II and 
III for use as pseudo-random number generators for the CDC 160U computer. 
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II. EMPIRICAL TEST PROCEDURES 

1. General 

Previous studies made using other machines 5j 6 j^7 

that the quality of the generated pseudo-random numbers using equations 
(1-2) and (1-3) can be very sensitive to the selection of A. Aside from 
a guarantee of the period length (See Appendix A), there is, presently, 
little mathematical theory to aid in the selection of parameters that will 
enhance the random attributes of the generated samples. The task is 
largely an empirical one wherein an A and a starting value, Y^, are se- 
lected and the subsequent output is submitted to a battery of tests. 

Such empirical analysis can only give general indications as to desirable 
parameters because it is only practical to investigate a small portion 
of the total cycle of the generator. This fact can be appreciated by 
observing that using a modulus of 2^”^ in (1-2) the fraction of the full 

period that would be considered, if even, say 10 elements were tested, 

-3 

is approximately 10 

As has been cited in Chapter I, the tests for randomness offered by 
Kendall and Smith were designed to check on the randomness of the indi- 
vidual digits of their table of random numbers. The generators under 
discussion here, however, produce pseudo-random numbers. Consequently, 
those tests that lend themselves to investigating certain properties 
associated with random numbers rather than those associated with indi- 
vidual digits are favored in the investigative work herein reported. 

Should a requirement arise for random digits when using equations (1-2) 
and (1-3), only the high order (left-most) bit positions of the binary 
representation of the generated number should be considered. (See 
Appendix A ) . 
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2. Uniforinly Distributed Random Variables, 

Many uses of computerized sequences of random numbers require that 
they be uniform 0,1 . Not only is this distribution used in. its own 
right but numbers from this distribution may be used to produce numbers 
with other distributions. Converting the generated pseudo-random numbers 
of equations (1-2) and (1-3) to uniform j^0,l is quickly accomplished in 
a high speed digital computer as it merely requires the positioning of a 
decimal point to the left of the generated numbers. (See Appendix B.) 

3, Description of Tests. 

The following tests were performed on the blocks of numbers forming 
a generated sequence, 

(a) Frequency test: The classic chi-square goodness of fit test 

was used as a frequency test . This test divides the unit interval 

into k equal parts. The generated numbers were each checked and a tally 
kept on their frequency distribution within the k intervals. This is, 
perhaps, the most frequently applied of the random tests. 

Attempts have been made to establish some sort of an optimal size 



8 



, present a procedure by 



for k. Mann and Wald, in their paper 
which the lengths of the class intervals are determined so that the prob- 
ability of each class under the null hypothesis is equal to 1/k where k 
is the number of class intervals. They establish a Theorem which says 
that, in the limit, the best (defined by them) value of k is given by 




d 
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is equal to the size of the critical region (probability of the critical 
region under the null hypothesis) and M is the number of random elements 
generated. For significance levels of ,01 and ,05 the corresponding 
values of C are 2,327 and l,bhS respectively. These, in turn, lead to 
k ^ 88 for ,01 and k^ llU for ,05 with N ^000. A k = 100 was used in 
the large sample tests reported on in Chapter III. 

(b) Serial Correlation; This test was set up following the formula 

given by Kendall . Lags from 1 through 10 were examined. 

(c) Run test; The number of runs up and down were examined and com- 
pared with the expected theoretical distribution using a chi-square test. 
The first nuraber to be examined is compared with the next number in the 
sequence. If the second is larger than the first, a run up of size one 
has occurred but is not yet recorded. The second and third numbers are 
then examined in the same way. If the third is greater than the second, 
a run of size two has occurred but is not yet recorded. If the third is 
smaller than the second, a run of size one is recorded etc, 10 

(d) Moments: The first four central moments were computed and com- 

r 1 

pared with those for the uniform p^l distribution. 

(e) Poker Test; This test was used to examine the sequence of 
digits within a nuraber. Starting at the highest order bit position, the 
digits were grouped to give five octal digits (11? bit positions). The 
poker hand value of these five digits was tallied. The next l5 bit 
positions of the same random nuraber were siiailarly examined and tallied. 
The succeeding generated numbers were examined in a similar fashion. A 
chi-square test was performed on the results. 
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li. Small Sajnple Tests. 



Exploratory tests vjere made on generators (1-2) and (1-3) using two 



loo were taken at each; A = 5,10,l5. . . This was done in order to 

ascertain, in a general way, the sensitivitjr of the generators to 

changes in A . The results presented in Tables 1 - 8 of Chapter III 

indicate that for these particular starting values the selection of A 

is critical. It further confirmed that a good selection of A was approx- 
, 1 

imately ( 2 ) for both generators. This supp 'rted the rule of thumb 



Selection of Yq = 1 also gives an indication of how the generator 
would respond when faced ;>rith a low number. Cases have been reported 



where, when this occurs, an unfortunate selection of A will make 
the generator extremel}’’ sluggish and can produce several hundred low 
numbers before yielding a more chaotic output. This succession of low 
numbers gives, in effect, a sequence all below 0.3. Such a character- 
istic, obviously, is undesired. 

Parallel procedures were followed using a sample size of 1000 num- 
bers at each A. 

3. Large Sample Tests. 

Having selected a value of A from the results of the small sample 
tests, five blocks of 10,000 numbers were generated and subjected to the 
battery of tests. 

V 

6. Modifications. 

Experiments were conducted to investigate the quality of pseudo- 
random numbers generated by addressing the output of one generator to a 



different starting values, Yq = 1 and Yq = 2^^ - 1. Sample sizes of 



that was offered by Greenberger 





• t 



n 
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second one having a different value of A, This linking process for 
equation (1-2) and (1-3) can be described as follows; 



^r 


+ 

C\J 

III 


^r-1 


(mod 2^"^) 




Qr 


E ( 2^ + 1 ) 


1 


(mod 2^^) 


(2-1) 


where Qj,|r = 


1> 2, 




the sequence of interest. 
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III. RESULTS \MU CONCLUSIONS 



1. Interpretation of Data. 

No attempt was made to select the best parameter A based on the 
small sample tests discussed in Chapter II. Rather, the observed data 
was prepared in tabular form for each of the generators. This allows 
general comparisons to be made on the effects of changing the inputs on 
each of the generators being investigated. In this way, certain ranges 
of parameters are observed to produce small samples whose performance is 
consistently poor under the battery of tests. It also permits a compari- 
son of one generator with the other as A is varied. For if one assumes, 
as will be done in this paper, that the generator that exhibits the 
least number of these "poor" indications is the preferred generator 
such a tabulation will make it apparent. This assui p ,ion does not take 
into account the differences in speed of generation between (1-2) and 
(1-3). In the GDC l60lj, this difference is one add instruction or 7.2 
micro-seconds per word and althou^ this is a consideration, for pur- 
poses of this analysis, it will be considered minor. The same can be 
said of the differences in cycle length between the generators under 
consideration because each of them is extremely large. In short, the 
point of focus in the discussion to follow is to be centered on the gen- 
erators ability to produce pseudo-random numbers that are of practical 
use . 

A particular parameter was considered "poor" if it failed to pass 
anyone of the following arbitrarily selected criteria: 

(a) The frequency chi-square statistic, must be 

3.3 — — 16.9 that is, for 9 degrees of freedom the 
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critical values of the chi-square were selected as o05 and 

(b) No long runs; where a long run will be considered a run of 
length six or greater for samples of size 100 and of length seven 
or greater in samples of size 1000. 

(c) Serial correlation coefficients of lag one must be less than 0.2 
for samples of size 100 and 0.05 for samples of size 1000. 

(d) Mean greater than or equal to 0.U30 and less than or equal to 

0 .^ 70 . 

2. Results of Small Sample Tests. 

Tables 1 through 8 indicate the results of varying A in the gener- 
ators 



Y 



' r ( 2^ + 3 ) Y 
r+1 ' ' T 



(mod 2^"^) 



(3-1) 



and 




(mod 2^"^) 



(3-2) 



In tables 1 through k Y^ = 1 while in tables 5 through 8 Y^ = 2^^- 1 . 
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Sample size 100 
Sample size 1000 



In applying the criterion of paragraph (1), the number of values 
of A reflecting a "poor" saraple in the case of (3-2) is greater than 
for (3-1). Generator (3-2) is very susceptible to long runs and, above 
an A of 25, cannot be considered a practical random nuraber generator. 
Using different starting values did not improve the output of (3-2); 
for = 1 only an A * 21 passed and for Yq = 2^^- 1 no value of A 
passed. Even relaxing the criteria so that two failures are allowed 
before an A is considered "poor" only admits A * 18 and A * 20 as being 
acceptable for both starting values. Further, using a value of 
G = (. 788 ) 2^"^ as suggested by Coveyou |loj did not make any notice- 
able improvement in (3-2). '' 

Table 9 shows the acceptable A’s for generator (3-1). 



TABLE 9 



1. 

1 


1 Acceptable A ' s 

j . . _ _ . . . . .. . 


1 1 


10 

\ 


15 


18 


19 21 22 


35 


Uo 


1 2^^-l 

1 

j 


' 10 

1 

1 


16 


19 


21 22 25 


35 


ho U5 



The above test results indicate that for these criteria generator (3-1) 
is much less erratic than (3-2). Six different values of A, 

( A = 10, 19 , 21, 22, 35, bo), are acceptable using both starting values and 
therefore it is suggested that A be chosen from these values. 

Some tests were also made using the approach suggested by equation 
(2-1) . Aside from some improvements in the poker tests results, no 
noticeable improveraentrs were made. Using this approach increases the 
time of generation per word and unless the results registered a 



2U 



significant improvement, it coujd not be economically justified. 

3 . Results of Large Sample Tests. 

Using the findings of parag^-aph (?) above, an A = 19 was selected 
for the five large samples to be tested. This particular value was 
soleeteJ over the faster A = 10 because it appeared to perform slightly 
better than A = 10 in the poker tests. The results are presented in 
Tables 10 through 17 and indicate that the samples satisfactorily pass 
the battery of tests. Figure 1 .shows the average serial correlation of 
the five samples for lags one to ten. Over these lags the correlations 
ranged from +0,0182 to -0.0201. 

H. Remarks on Computer Vtork. 

All of the tests used in this paper were written using FORTRAN 
language. These have been catalogued and are available at the Computer 
Center at the United States Naval Postgraduate School. Programs using 
an A = 19 in Equation (3~1) have been written both for Scrap and for 
Fortran and are available in the computer library. Appendix B lists 
the machine language steps required to produce a random nuinber accord- 
ing to (3-1) and the additional steps required to convert it to a 
floating point fraction between zero and one. These steps have been 
included to allow a programmer a little more flexibility in using the 
generator. In many cases, it may be easier to include these steps 
directly in a routine rather than go to a computer library subroutine. 

5 . Conclusions . 

(1) To determine a suitable value of A in (3-1) the methods used 
in small sample testing are useful. Further refinements could be 
made by 
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TABLE 10 



FREQUENCY TEST 
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1 


#1 


#2 








y2 

^9 


3.39 


11.66 


5 . '46 


13.36 


7 .U 1 


: PROBABILITY^^ 
INTERVAL 


(.90,. 95) 


(.20, .30) 


(.30,. 70) 


(.10, .20) 


I 

(. 50 , . 70 ) 


99 


102. Uii 


91.98 


75 . 3U 


119.52 


9ii.O 


PROBABILITY 

INTERVAL 


(.50, .25) 


(.50, .75) 


(.95,. 975) 


(. 05 , . 10 ) 


(. 50 , .75) 


■*''(c,d) implie 


s c - P ! X^ 


^0 : 


-=: d where X^ 


2 

is the observed 


value of the 
.90 P 1” X 


chi-square 
^ =- 3.39 - 


statistic 

.95. 


. For example 


, (. 90 , .95) 


implies 



TABLE 11 
RUN TEST 



RUN 

SIZE 


NUMBER OBSERVED 


’ 1 

i 

EXPECTED 1 


#1 


n 


B 


//l4 


B 

..J 


NUMBER 


1 


UIO 5 


14210 


UI 66 


U 228 


UloU 


I 4 I 66.78 


2 


1806 


1780 


1822 


1829 


l85l 


1 1833.09 


3 


5U5 


528 


558 


523 


5o6 


1 527.65 


I 4 


121 


127 


102 


113 


1U3 


i 115 . OI 4 

1 1 


5 ^ 


25 


26 


19 


16 


18 


j 20.33 


6 


6 


1 


2 


2 


2 


i 3.8 

\ 
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TABLE 12 





RUN 


TEST CHI-SQUARE RESULTS 
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ff2 


/^3 


//U 


#5 
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h,22 


3.59 


3.60 


2. he 


9.U6 


PROBABILITY 












INTERVAL 


(.30, .50) 


(.30,. 50) 


(.30,. 50) 


■ (.50, .70) 


(.05,. 10) 



TABLE 13 

POKER TEST (FIRST FIVE OCTAL DIGITS) 



POKER 

HAND 


#1 


#2 


#3 




#5 


EXP. 

NUMBER 


BUST 


200h 


2027 


2000 


2010 


1979 


2050.78 


ONE PAIR 


5073 


5091 


5109 


5205 


5163 


5126,95. 


Tl'JO PAIR 


1623 


1597 


1578 


lh?5 


1616 


1538.09 


THREE OF A KIND 


1035 


1057 


1050 


10U6 


991 


1025.39 


FULL HOUSE 


167 


lh5 


168 


165 


168 


170.90 


FOUR OF A KIND 


96 


81 


92 


98 


83 


85.h5 


FIVE OF A KIND 


2 


2 


3 


1 


0 


2.hh 
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T'iBLE Ih 

poker test chi-square results (first five octal DIGITS) 
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#3 




#5 


2 


5.75 


7.68 


3.59 


6.61 


8.18 


PROBABILITY 

INTERVAL 


(.30, .50) 


(.10,. 20) 


(.50, .70) 


(.20, .30) 


(.10, .20) 



TABLE 15 

CHARACTERISTICS OF GEi'JERATED UNIFORM DISTRIBUTION 
(FIVE SAMPLES OF 10,000) 





1 


2 


3 




5 


EXP. 

VALUE 


MEAN 


.50103 


.50581; 


.50076 


.50358 


.50027 


0.50000 


VARIANCE . 


.08339 


.08U813 


.08357 


.08306 


.081;77 


0.83333 


SKEVn^SS 


.00029 


.00073 


.OOOlli 


.00002 


.00001 


0.00000 


KURTOSIS 


.01255 


.01288 


.012U6 


.01238 


.012781 


0.01250 
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TABLE 16 



POKER TEST (SECOND FIVE OCTAL DIGIT POSITIONS) 



1 

POKER 

HAND 


#1 


#2 


#3 




#0 


EXP. 1 

NU1‘IBER ! 


BUST 


2005 


2003 


2060 


1997 


2081 


2000.78 


ONE PAIR 


0116 


01Oii 


0106 


011+8 


0128 


0126,90 


Tl'JO PAIR 


1039 


1060 


1020 


1061 


1030 


1038.09 


THREE OF A KIND 


1029 


1008 


1011 


1032 


990 


1020.39 


FULL HOUSE 


168 


192 


166 


160 


167 


170.90 


FOUR OF A KII^ID 


89 


79 


70 


101 


89 


80.1+0 


FIVE OF A KIND 


2 


h 


7 


1 


0 


2. Li} 



*Four and Five of a Kind combined for ^ d, of f. 
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POKER TEST CHI-SQUARE RESULTS (SECOND FIVE OCTAL DIGIT POSITIONS) 



! 

1 

i 

1 


//I 


#2 




;fl+ 


#0 


2 

’'5 


0.18 


0.982 


1.32 


1+.80 


1.86 


i PROBABILITY 
1 INTERVAL 


^.99 


(.98-. 90^ 


(.90-. 90) 


(.00-. 30) 


(90-.80) 
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(a) selecting a fev; more starting values, Yq, and tabulating 
the results as in Tables 1 through 

(b) modifying the criterion used in the sifting process to 
meet an individual's unique demands. For example, if he 
desired an unbiased sequence of uniforms, he could tighten 
the restriction on the acceptability limits of the mean. 

(2) Based on the test results. Equation (3-1) is to be preferred 
to (3-2) as a generator of random numbers. The speed advantage in (3-2) 
of about 7.2 micro-seconds per word is masked by its demonstrated ten- 
dency toward long runs and poor frequency distributions for most 
values of A. 

(3) Five blocks of 10,000 numbers each were tested using gener- 
ator (3-1) with A = 19. All tests were passed satisfactorily and it 
is concluded that 

1 = ( 2^9 + 3 ) (mod 2^^) (3-3) 

will perform as a pseudo-random generator for the GDC l60i;. 

(U) The chaining of generators as suggested in E.^uation (2-1) did 
not lead to any positive indications that a better sequence of pseudo- 
random numbers would result. Unless this can be shown, the additional 
time of the generation is not warranted. 
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APPEi'JDIX A 



GEWERAT :)R CHARACTERI3T ICS 

1. General. 

In 19h9, Lehmer suggested using the multiplicative congruen- 

tial method for generating pseudo-random numbers on a digital computer. 
Since that time, this method has been used, tested and reported on by a 



number of investigations, for example |12| 
method on a binary computer of word size P, the recurrence relation most 
frequently takes the form 



13 



In using this 



'fr+l = ’'"r 



(mod 2 ‘ ) 



(A-1) 



p 

where L is a fixed odd integer and O-^Y^ —2 -1 being relatively prime 
to 2^. This sequence is periodic, its length depending on the choice of 
L, Yj, and P. VJhen the indicated iteration is performed, say n times, 
(1-1) becomes; 

^r+n ^ 2^) 

and for this reason is sometimes referred to as the power residue method. 

The numbers Yr/^P (r = 0,1,2 ... . ) become the desired uniformly dis- 

tributed elements from the unit interval. Barnett ilUj in 1961 showed 

that necessary and sufficient conditions to obtain a maximum cycle of 
P-2 

2 elements, all of xirhich are distinct, are: 



(a) L = 3 (laod 8) or L = (mod P) (A-2) 

(b) Y,.,= 1 (mod 2), (A-3) 

that is, Y, must be odd. 

If these conditions are not met the cycle length can be affected 
and in some cases the process degenerates to zero. 
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2. Discussion of Equation (1-2). 

Vvliile the sequence produced by (1-1) has been found satisfactory, 
nevertheless, it does imply the use of the multiplication instruction in 
the computer. Relatively speaking, this is a time-consuming operation 
and interest continued on methods that would avoid its use and still pro- 
duce the desired sequence. Greenberger jlij proposed using an L of the 

A 

form (2 +3) where 2^A<P. It is seen that such an L satisfies 

p_2 

the conditions necessary to insure a cycle of 2' . This fomi has the 

advantage of substituting a shift operation of size A and three add 
instructions for the multiplication operation. In the GDC l60li, the 
multiplication instruction 1^ requires 2^.2 plus .8n micro-sounds; 
\jhere n is the number of ones in the multiplier. On the other hand, the 
shift instruction takes 2.8 plus .lis micro-sounds, where s is the number 
of places shifted and an add instruction takes 7.2 micro-sounds. In 
operations where large quantities of random numbers are required this 
saving can make a significant contribution toward reducing machine time. 

When this form of L is translated to the GDC loOli, equation (1-1) 
becomes: 



Yr+i ^ (2^ + 3) Yr 



(mod 2^'^) 



(A-2) 



where with 2 — A ^ h7 the period of the sequence is 2^^, that is, 

2^^ distinct numbers are produced before the sequence repeats. Each of 
the numbers produced are odd and belongs to one of two mutually exclu- 
sive integer sets depending on the selection of Y^ where, if r belongs 

to one set r+2 cannot. These two sets between them exhaust all of the 

h7 

possible odd numbers in the interval 1 to 2 . Because the numbers are 

odd this means that the two least significant bit positions in the 
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binary representation of a number will not change, that is, their cycle 
is zero. The third bit position has period 2^, the fourth 2^ and so on. 
For this reason, if random digits are to be used only the most signifi- 
cant digits of a number should be considered. 

No mathematical rules have been developed to accurately determine a 
particular choice of A although it is clear that an unfortunate selection 
here can result in an unacceptable generator. Coveyou jl!^ did offer 
a criterion for selecting A in terms of an approximation formula he 
derived for the reduction of serial correlation between the numbers. 
Subsequently, Greenberger introduced a correction term to be 

applied to Coveyou 's expression and demonstrated that even though serial 
correlation might be optimized, nonetheless, the generator might suffer 
from other shortcomings, such as producing long sequences of low numbers. 

3. Discussion of Equation (1-3) 

Rotenberg [bj tested a sequence 

Yi.+lELYj,+ C (mod 2^) (A-h) 

where L and C are odd integers less than P and 0 Yr:^ 2^. In this 
case, a maximum period of 2P is obtained for any 

L - 1 (mod h) (A-5) 

An L of the form (2^ + 1) with A — 2 satisfies the condition of (A-5) and 
was used by Rotenberg in his tests. He examined two values of C, 

C = 1 and C * (.788) 2 , the latter value resulted from Coveyou' s approxi- 
mation mentioned above. Rotenberg reported that there were no s:' gnificant 
differences in his results using either one of these C values. 
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For the GDC I60I4, this generator becomes 

Yr+i- (2A+l)Yr+C (mod 2^"^) (A-6) 

This type has an edge on form (A-2) in that it requires one less add 
instructor for each new number. It also has a longer period although 
this does not seem too significant in that will produce 2^^ 

distinct elements. 

U. Sub Periods. 

It has been pointed out by Peach 16 that in the case of (A-6) 
certain harmonics exist throughout its period. These tend, he feels, 
to lend a greater stability to the generated numbers than should be ex- 
pected of a random sample. He illustrates his point by making frequency 
tests on large samples and notes that they appear to be unduly uniform, 
although his results do not overwhelmingly indicate this. '!'Jhether too 
much stability is objectionable depends on the particular application. 

If a user wanted to estimate an unknown mean without bias then this 
characteristic might not be critical. If the variance is a point of 
interest too much stability could be critical. 

5. An Example. 

To illustrate the multiplicative congruential method consider 



equations .(A-ia) which with L = = 0, C = b, P = 3 becomes 



'fr.l = 5 «r * 



(mod 2 ^) 

The sequence is determined as follows. 
Substituting Y^ = 0 into equation (A-7) gives 

Yt = 5 (mod 2^) 



(A-7^ 
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This means that is the remainder after dividing 5 by 2 
= 5. Substituting this back into (A-7 ) gives: 

Y 2 =5 I 6 ) (mod 2 ' ) 

Dividing 30 by 2^ gives a remainder 6 . Therefore Y 2 = 6 
repeated giving a sequence Y] = Y 2 = 0 , Y^ = 3 , Yj^ 
Y 5 =2, Yy = 7, Yp = 0. of period 2^. 



. This gives 



This is 
= = 1 . 
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i 

ii 



■? Nil? • 
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APPENDIX B 



PROGRANJ/lING THE GEiffiRATOR 



Method 

1. The Relation, 



Y. 



r + 1 



(2^^ Yj, (mod 2^^) 



Y = 2 
^o 



li6 - i 



can be programraed for the GDC l6oii as follows: 

(a) Define: MASK = iiOOOOOOOOOODGOOO Binary 

MASK2 = 2000GOOOOOOOGOOO Binarj' 




MACHIN'E LANGUAGE ROUTIfJE: 
ENQ (0) 

LDA (X^) 

LLS (19) 

SCL (I'iASK) 

ADD (Xq) 

ADD (Xq) 

ADD (X^) 

3TA (X„) 



(b) At this point, the random number is in the accumulatoi- and has 
been stored in in preparation for the next number if required. It 
is now required to convert it from its present integer form to a float- 
ing point fraction on the unit interval. Tliis is done by continuing on 
with the following instructions: 
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ARo (11) 

ADD (HASK2) 
FAD (MASK2) 



At this tine, the desired uniformly distributed o^l nuraher is in 
the A- register. The time of generation is about 90 micro-seconds per 
number , 



Uo 
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Thesis 5689 ; 

B2388 Barron 

c.l Random number gener- 

ation on the CDC 1604. 



